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Abstract 

Random sampling of large Markov matrices with a tunable spectral gap, a nonuniform 
stationary distribution, and a nondegenerate limiting empirical spectral distribution (ESD) is 
useful. Fix c > 0 and p > 0. Let A n be the adjacency matrix of a random graph following 
G (n,p/n), known as the Erdos-Renyi distribution. Add c/n to each entry of A n and then 
normalize its rows. It is shown that the resulting Markov matrix has the desired properties. 
Its ESD weakly converges in probability to a symmetric nondegenerate distribution, and its 
extremal eigenvalues, other than 1, fall in [—1/^/1 + c/k, — b\ U [b, 1 /-^/l + c/k\ for any 0 < b < 
1/vTTc, where k = [p\ + 1. Thus, for p £ (0,1), the spectral gap tends to 1 — l/vT+’c. 
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1 Introduction 

The spectral properties of random Markov matrices have received increasing attention over the 
years [4-6, 10, 24, 28]. In applications, it is useful to randomly sample a large Markov matrix, 
such that the mixing rate of the associated Markov chain is controllable. The chain can be used, 
for example, to evaluate the performance of a data analytic procedure under various strengths of 
statistical dependency within data [25]. By the well known connection between mixing rate and 
eigenvalues of Markov matrix [12, 26], the issue may be cast as how to sample large Markov matrices 
with a specified spectral gap. This note addresses the issue for reversible Markov matrices. 

Denote by M n the set ofnxn matrices with all entries being nonnegative. For M € A4 n , if 
its eigenvalues are Ai(M), ..., A n (M), counting multiplicity, then its spectral radius is g(M) = 
max | Aj(M) | and its empirical spectral distribution (ESD) is 

n 

HM = n~ l X hi(M), 
i =1 

where 5 X is the probability measure concentrated at x. By Perron-Frobenius theorem ([16], p. 534), 
g(M) is an eigenvalue of M. If Ml n = l n , where l n is the column vector of n l’s, then M is called 
a Markov matrix and g(M) = 1. Letting A n (M) = g(M), A*(M) = maxj <n |A,(M)| and 1 — A*(M) 
are known as the second largest absolute eigenvalue and the spectral gap of M, respectively. For 
X £ -A4 n , if all the entries of a := Xl n are positive, then its row-normalized version refers to the 
Markov matrix M = D~ 1 X, where D a denotes the diagonal matrix whose diagonal equals a. If A 
is symmetric, then the Markov chain with transition matrix M and initial distribution it = a/l' n a 
is stationary and has the same distribution as its time reversal, and for this reason M is called 

_1 /O _1 /O 1 /o 

reversible relative to it. Moreover, all A i(M) are real as M is similar to D a ' MD a ' , where D a 
denotes any symmetric matrix whose square equals D a . 

Let X n G AT n be symmetric random matrices with positive entries almost surely (a.s.). Let 
M n be its row-normalized version. Suppose the diagonal and upper diagonal entries of X n are i.i.d. 
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~ v n . If u n = v for all n, then by [4], provided that the 4th moment of v is finite, A*(M n ) —>• 0 a.s. 
as n —>• oo. On the other hand, if v is in the domain of attraction of a stable law of index in (0, 2), 
then by [5], A*(M n ) —>• 1 a.s. In either case, the spectral gap of M n cannot be tuned. The results 
suggest that, in order for the spectral gap or, equivalently, A*(M n ) to be tunable, the (marginal) 
distribution of the entries of X n needs to change according to n. 

Indeed, there are simple solutions along this line. Given n > 5, randomly pick four different 
numbers k,l,s and t from 1, ..., n. Let A n = (e^) € M. n with = l{{z,j} = {k,l} or {s,t}}, 
where 1{-} is the indicator function; A n is the adjacency matrix of a graph on n vertices with 
only two edges. Given c > 0, let M n be the row-normalized version of cJ n + A n , with J n € Ai n a 
matrix of l/n’s. As det(z — M n ) = [z+ 1/(1 + c)] 2 z n ~ 5 [z — (1 — 4/n)/(l + c)}\z — 1/(1 +c)](z — 1), 
A*(M n ) = 1/(1 + c), so it can be set at any value in (0, 1). 

The main problem with the example is that M n has few features. It is nearly the transition 
matrix of a chain of i.i.d. random variables uniformly taking n values. The lack of features is also 
reflected in the ESD of M n , which converges to 5q as n —>• oo. Despite this, the example shows 
that it is possible to tune the spectral gap by using sparse random graphs. In general, let A n be 
the adjacency matrix of a random graph. Define the row-normalized version of cJ n + A n as 

M n = D~ 1 (cJ n +A n ) with D n = D cln+An i n . (1) 

Although A n can be highly reducible, M n is always irreducible and aperiodic and so A*(M n ) < 1. 
Since all the eigenvalues of M n are real, we always assume that they are sorted as 

-1 < Ai(M n ) < ... < A n _i(M n ) < A n (M n ) = 1. 

Then A*(M n ) = max(|Ai(M n )|, |A n _i(M n )|). We simply call M n reversible, as there is only one 
stationary distribution associated with it. The closely related matrix I n — D n 7 ( cJ n + A n )D n 7 
is known as a normalized Laplacian regularized by c. The effects of c on spectral clustering and 
concentration of the ESD have been studied in statistical machine learning [19, 21]. 

The close relation between random matrices and random graphs is well known; see [5, 7, 13, 
15, 17, 18, 20, 22, 23, 27-29] and references therein. In [22], it is shown that if A n is the adjacency 
matrix of a random graph following the uniform distribution G Ht d on the set of regular graphs 
on n vertices with fixed degree d > 2, then as n —> oo, /j>a„ weakly converges a.s. with limiting 
density f(x) = d(4d — 4 — x 2 ) 1 ^ 2 /[2ir(d 2 — x 2 )], where a + := max(a,0). By Weyl’s inequality, 
Aj(D“ 1 A n ) < A i{M n ) < Xi + i(D~ 1 A n ) for i < n (cf. (2)). Consequently, M n and D~ 1 A n = 
(c + d)~ 1 A n have the same limiting ESD density (c + d)f((c + d)x), whose support is the interval 
between ±2V d — l/(c + d). Thus A*(M n ) is asymptotically lower bounded by 2y/d — l/(c + d). On 
the other hand, by the above Weyl’s inequality and the fact that g(A n ) is less than the maximum 
row sum of A n ([16], p. 345-347), A*(M n ) < g(A n )/(c + d) < d/(c + d). In particular, when 
d = 2, in which case the graph consists of disjoint cycles, A*(M n ) —> 2/(c + 2) a.s. Also, under 
various distributions on regular multigraphs of fixed degree d that allow multiple edges and, in some 
cases, self-loops, for any fixed l, |Az(A„)| and A n _;(A n ) converge to 2 y/d — 1 in probability, yielding 
A*(M n ) —>• 2 yjd — l/(c + d) [14]. However, when d > 2, A*(M n ) cannot be arbitrarily tuned as it 
is asymptotically upper bounded by 2 yjd — 1/d < 1. Perhaps important, under any distribution 
on regular (multi)graphs, since M n is doubly Markov, i.e., M' n is Markov as well, the stationary 
distribution associated with M n is uniform. If one wishes to sample a large Markov matrix with a 
nonuniform stationary distribution, then a different random graph needs to be exploited. We also 
mention that for a uniformly sampled doubly Markov matrix, which is irreversible a.s., its limiting 
ESD is degenerate [24]. 
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We shall consider the row-normalized version M n of cJ n + A n with A n the adjacency matrix of 
a random graph following G (n,p/n), the distribution on graphs on n vertices such that each pair of 
vertices is connected by an edge with probability p/n, independently from the other pairs ([2], VII). 
It is easy to see that for large n, the stationary distribution associated with M n is nonuniform with 
high probability. We shall fix c > 0 and p > 0 when deriving the asymptotic spectral properties of 
M n . It is known that for both G n ,d and G(n,p/n), if d —>• oo and p —>• oo as n —>• oo, then the ESD 
of suitably scaled and centered A n tends to the semi-circle law [13, 28]. It is also known that when 
p > 1 is fixed, the adjacency matrix of the giant component of a G(n,p/n)-distributed graph has 
a spectral gap asymptotically equal to 0 [23]. However, these results provide no indication on the 
spectral properties of M n . 

For the rest of the note, denote 


t c = 1/y/l + c, c > 0 . 

One of the main results of the note is the following. 

Theorem 1 . Fix c > 0 and p > 0. For n > p, let A n be the adjacency matrix of a random graph 
following G (n,p/n). Let k = [p\ + 1. Fix l > 1 and 0 < b < r c . Then P{6 < A n -i{M n ) < 
T c / k and — r c /k < A i(M n ) < —b} —>• 1 as n —>• oo. 

Thus, roughly speaking, A*(M n ) asymptotically lies between r c and r c / k . In particular, if 
p € (0,1), then A*(M n ) —>• r c in probability. To prove Theorem 1, in Section 2, we show that A*(M n ) 
is asymptotically dominated by r c / k . Then, in Section 3, we show that p,M n weakly converges 
in probability to a symmetric nondegenerate distribution and characterize the moments of the 
limiting distribution in terms of a random walk on a Galton-Watson tree. The proof uses the 
local convergence of random graphs [5, 8 ]. In Section 4, we show that the essential supremum of 
the limiting distribution is r c , which together with the result in Section 2 proves Theorem 1. In 
this section we also report some numerical results which suggest that bounds for A*(M n ) are not 
tight, especially the upper bound when p is large. Finally, in Section 5, we provide a more explicit 
formula for the moments of the limit of pM n , using the standard moment method. Some of the 
results in previous sections can also be established by the method [ 11 ]. 

1.1 Notation 

Following [2], a (labeled) graph G has no multiple edges or self-loops, and all its edges are undi¬ 
rected. Denote by V(G) and E(G) the vertex set and edge set of G, respectively. Each e € E(G) 
is an unordered pair {u,v}, with u / v € V{G)\ u is called an endpoint of e, denoted u G e. 
When direction has to be taken into account, denote by (u, v) the directed edge starting at u 
and ending at v. The adjacency matrix of G is Aq = (s uv ) u ,v&G with e uv = 1 {{u, v} € E(G)}. 
Denote by |H| the cardinality of a set A. Denote |G| = |V(G')| and e(G) = \E(G)\, and refer to 
them as the order and size of G, respectively. For brevity, denote u € G if u G V{G). Denote by 
d(u,G) := |{e € E(G) : u € e}\ the degree of u € G. If G' is another graph, denote by G U G' 
the graph with vertex set V(G) U V(G') and edge set E(G) U E(G'), and denote G ~ G' if the 
two graphs are isomorphic ([ 2 ], p. 3). If v € G and v' € G and if there is a graph isomorphism 
a : G —>• G', such that a(v) = v', then (G,v) and (G',v') are called isomorphic rooted graphs 
(rooted with v and v', respectively). For a finite set I, denote by 1/ the column vector of l’s 
indexed by /. For k > 1, a path on I of length A: is a sequence v = (ui,..., Ufc+i) with Vi € / and 
Vi 7 ^ |-i for i < k', note the requirement that adjacent vfs be different. If v k+ \ = v\, then v is 

said to be closed. 
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For properties of G(n,a), see [2, 3]. For a G [0,1], denote by Bern(a) the Bernoulli distribution 
with mass a on 1. Denote by Po(p) the Poisson distribution with mean p > 0. The essential 
supremum of a measure v on M is ess sup v = sup{x : u(x,oo) > 0}. For M G M n and k > 0, 
denote by /3 k (M) the kth moment of pm, which equals (l/n)tr(M fc ) ([1], Eq. (1.3.2)). 


2 Upp er bound of spectral radius 


Fix c > 0 and p > 0. For n > p, let A n = A G with G ~ G (n,p/n). Define M n and D n by (1). The 
spectrum of M n is identical to that of D n ] ' 2 (cJ n + A n )D n . Since D n l ^ 2 J n D n l ' 2 is of rank one 
with the only nonzero eigenvalue being positive, by Weyl’s inequality ([16], Corollary 4.3.3) 

Ai(Z?“ 1 A n ) < \i(M n ) < X i+ i(D~ 1 A n ), 1 <i<n. (2) 

Consequently, to prove the bound involving r c / k in Theorem 1, i.e., given l > 1, P{A n ~i(M n ) < r c / k 
and A i(M n ) > —T c / k ) —> 1, it suffices to prove the following. 

Proposition 2. Let p > 0 and k = [p\ + 1. Then p{g(D~ 1 A n ) < T c / k } —>• 1 as n —>• oo. 

For graph G, denote 

Kg = K g (c ) = D~I v{g)+Ag1v{g) A g (3) 

and analogously K n = D~ l A n . Put q = q{K g ). By Perron-Frobenius theorem ([16], p. 534) 
q = A|g| (K g ) and if |G| > 1 and G is connected, then q > 0 and there is a vector / = (f(u)) u&G 
with all f u > 0, such that 


Kef = qf- 


(4) 


Denote by N(u) the neighborhood of u in G , i.e., the set of v € G with {u, u} G E(G). 

Lemma 3. Let G be connected with |G| > 1 and C / 0 be a subset ofV(G). Denote by h(u) the 
distance of u € G to C. Define u(u) = f{u)q h K). For i = 0, ±1, define Ni(u) = {v € N(u) : 
h[v) = h(u) + i} and di(u) = \Ni(u)\. Then 

q~ 1 '^2d 0 (u)ijo(u) + q~ 2 ^d_i(rt)w(u) = Y / i d o( u ) + d-\(u) + c]u(u). (5) 

Proof. Since Nq(u), N_(u), and N + i(u) partition N(u ), (4) can be written as 

Y /(«)+ /( u )+ /(^) = q[c + d(u,G)]f(u). 

v(E.N—i(u) 'PGATo(n) vEN+i (u) 

Multiplying both sides by q h ( u K l yields 

Y u{y) + q~ l Y u{v)+q ~ 2 Y u ( v ) = i c + d (u, G)]u(u). 

v£N—i(u) v£No(u) v£N+i(u) 

Take sum over u. Since v € N-i(u) u G N + \(v) and v G Nq(u) 4=> u G Nq(v), 

Y = u( y ) = Y d +i( v ) uj ( v ) 

u v&N-i(u) v u£N + i(u) V 



and likewise, 

X] X] =J2 d o(v)v(v), 

u v£N 0 (u) v 

Combining the equations and noticing d(u, G) 


Y ='52 d -i( v ) uj ( v )- 

u vGN + i(u) v 

do(u) + d-i(u) + d + i(u), (5) then follows. 


□ 
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Lemma 4. Let G be a connected graph. If G is a tree or a unicyclic graph, then 


q{K g ) < t c . 


( 6 ) 


Furthermore, if G is a unicyclic graph, then 

q(K g ) > (1 + c/2) _1 with “=” G is a cycle. 


(7) 


Proof. First, let G be a tree. If |G| = 1, then Kq = 0 and ( 6 ) is trivial. Let |G| > 2. Pick an 
arbitrary vertex 6 € G and let C = { 6 *}. It is easy to see that for any u € G, do(u) = 0 and 
d_i(u) = l{tt 6}. Then ( 6 ) follows from (5), which now takes the form 

= (1 + c)^w(V) + cu(6). (8) 

Next, let G be unicyclic. Let C be the cycle subgraph of G. Then \C\ > 3. The subgraph of G 
obtained by removing the edges in C consists of \C\ isolated trees, each containing exactly one 
vertex in C. It can be seen that do(u) = 21 {u € C} and d-\{u) = 1{« 0 C}. Then by (5), 

( 2 /<?) X w ( u ) + °l ~ 2 X = ( c + 2 ) X + ( c + X ) X 

ueC u£C ueC ugc 

If G is a cycle, then C = G and the equation yields q = 1/(1 + c/2). If G is not a cycle, then 
Ylugc u ( u ) >0- If Q < 1/(1 + c/2), then from 2/q > c+ 2 and YlueC u ( u ) > 0; it follows that 
c + 1 > q~ 2 , or q > t c > 1/(1 + c/2), which is a contradiction. Thus q > 1/(1 + c/2). But then 
2/q < c + 2, implying q~ 2 > c + 1, or q < t c . □ 

It may be worth noting that if G is a tree, then |G| —>• oo does not guarantee that q{Kq) —>• r c . 
For example, suppose d{y, G) < 1 + c for all v € G. Put do = |"c]. Then d{y, G) < do- Let / be as 
in (4) and 9 = argmax f{v). Then for k > 1, Ylh(u)=k UJ ( u ) — ^o<Z fe f{Q) < [do/(l + c )] fc o;(0), giving 
E U / 0 W (“) < with 6 = )CfcK)/(l + c)] k < oo. Then by (8), q y4 r c . 

Proof of Proposition 2. By definition, = D~ l A n = ifc with G ~ G (n,p/n). First, suppose 
0 < p < 1. Write the connected components of G as G*i,..., G s . Then Kq can be partitioned as 


K g 


(K Gi 

V 


\ 

KgJ 


The eigenvalues of Kq therefore are exactly those of K Gi , counting multiplicity. Since 0 < p < 1, 
P{all Gi are trees or unicyclic graphs} —» 1 as n —> oo ([3], Corollary 5.8). This combined with 
Lemma 4 yields P {g(K n ) < r c } —>• 1. 

To continue, note that given 0 < po < p\ < 1, as n —> oo, 


inf P p {o(K n ) < t c } -)• 1 , 


Po<p<pi 


(9) 


where P p denotes probability under G (n,p/n). Indeed, from the proof of Theorem 5.7 and Corollary 
5.8 in [3], as n —>■ oo, inf Po <p< Pl P{every component of G is a tree or a unicyclic graph} —» 1. Then 
(9) follows from the same argument for the already-proved case 0 < p < 1. 
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Now let p > 1. Then k := [p\ + 1 > 1. For n > p, let T\^ n , ..., Tk, n be i.i.d. ~ Aq with 
G ~ G (n,p' n /n), where 


Pn = 


P 


k — (k — 1 )p/n 


Since p' n € (0, n), G is well defined. Let T n = (tij) = Yls=i^,n- Since tij, i < j, are i.i.d., for any 
B = (bij) € {0, l} nxn with bij = bji and b tt = 0, 


P{T n = B | T n € {0, l} nxn } = 


P{Lj = hj, i < j} 


n 


P {bij — } 


P {tij G { 0 , 1 }, i < j} r~ P{% € {0,1}} ' 

i<.j 


Since P {tij = 0} = (1 — p' n /n) k and P {tij = 1} = k(p' n /n)( 1 — p' n /n) k , direct calculation 
shows that conditional on it being in {0, l} nxn , T n has the same distribution as A n . For i < j, 
P {t^ € {0,1}} > 1 — [k(k — l)/2 }{p' n /n) 2 . On the other hand, p' n —> p/k as n —>• oo. Then for n 
large enough, P{T n € {0, l} n } > exp(-p 2 ), so letting A n = D c i n +T n i n and C = exp (p 2 ), for any x, 


P {g(K n ) >x} = P{^(A- 1 T n ) > x | T n g {0, \} nxn ) 
<cp{e(A„-% n A^ )>3:} . 

Put A s n — ^ci n /k+T Sin i n and B s n — A s , n T s n /\ s n . Then A ra — A \ tn T ■ ■ ■ T A n and 

- 1/2 


( 10 ) 


A” 1/2 T A “ x /2 _ V A“ 1 / 2 A 1/2 b a 1/2 a 

- L n L -*n ~ / J n L -*s,n JD s,n L -±s,n LX 
s 

Fix an arbitrary a € (p/k, 1). For n large enough, p' n € (p/k, a}. Then by (9), the probability 
of the event that g(B S)n ) < r c /f. for all 1 < s < k tends to 1. On this event, for any u € M n , 

s 

s 

< T c/kY^ l A ^n A n 1/2 “| 2 = T c /k\u\ 2 . 

s 

It follows that P{ e (A" 1 / 2 T n A- 1/2 ) < r c/fc } ->• 1, so by (10), P{g(D~ 1 A n ) < r c / fc } ->• 1. 


□ 


3 Convergence of ESD 

Let A n , M n , D n , and K n = D~ l A n be as in previous sections. We shall show that pM n weakly 
converges as n —>• oo. From Weyl’s inequality (2), weakly converges in probability (resp. a.s.) 

p,K n does so in probability (resp. a.s.) and, provided the convergence holds, the two ESDs 
have the same limit. Therefore, we shall focus on px n instead. The approach we shall take is the 
local convergence of random graphs; see [8] and references therein, and [5] for extension to the ESD 
of random matrices whose entries belong to the domain of attraction of stable laws. 

Let G be a graph and vq € G. Fix c > 0. Consider the following random walk on G starting from 
vq at step 0. If d(v o, G) > 1, then if at step k > 0 the random walk is at v with d(v, G) = d(> 1), 
then at step k + 1 , it either moves to a neighbor of v with probability l/(c + d), or is killed with 
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probability c/(c + d). If d(vo,G) = 0, then the random walk is killed at step 1, regardless of the 
value of c. Let 


r k {G, vo, c ) = P{the random walk is alive and at vq at step k}. 

Let 0 be an arbitrary element. Denote by [(G, t>o)] the class of graphs rooted with 0 that are 
isomorphic to (G,v o). Then r k (G,vo,c) depends on {G,v o) only through [(G, vo)]. 

Recall that in order for K n = to be always well defined, c has to be strictly positive. 

In the following, we redefine D n such that its ith diagonal element is 1 if the entire ith row of A n 
is 0. With this definition, c can be 0. 

Theorem 5. Let c > 0. As n —>• oo, qi k„ weakly converges in probability. The weak convergence 
is a.s. if n is replaced with any subsequence nj with < oo. The limiting distribution is 

symmetric and nondegenerate, and for k > 1, its kth moment is /3k = ~Er k (T, 0, c), where T be a 
random Galton- Watson tree rooted with 0 and with Po(p) offspring distribution. 

Note that for any tree T, if k is odd, then r k (T, 0, c) = 0 and hence /3k = 0. This immediately 
leads to the symmetry of the limiting distribution. 

Proof. Put K n = ( Xij ). Denote by C k ,n the set of closed paths of length k on {1,... ,n}. For 
i G Ck,n, denote x(i) = 3 • • • Xi k _ 1 i k Xi k i 1 . Then for k > 1 and s = 1,..., n, the sth diagonal 

entry of Kjf is 

(K*)„ = + *(<)• 

Since x(i) is the probability that the random walk is alive after traversing the closed path i, 

{Kn)ss = r k (G,s,c), s = l,...,n. 

For a random walk on G that starts from s, if it returns to s at step k, then the vertices it visits 
by then each has at most distance k — 1 from s, and so the neighbors of each such vertex has at 
most distance k from s. Denote by G k , s the subgraph of G whose vertex set consists of vertices 
with distance from s no greater than k and whose edge set consists of edges in G connecting these 
vertices. Then r k (G, s, c) = rfc(Gfc >s , s, c ). It is well known that, given s, as n —>• oo, G rooted with 
s converges locally to T in distribution. This means that for any k, G k)S rooted with s converges 
in distribution to T k , the subtree of T consisting of 0 and its first k generations of descendants; see 
for example [8]. As a result, r k {G kl \, 1, c) —> rfc(Tfc,0,c) = r k (T, 0,c) in distribution. By the above 
displays, /3k(K n ) = n~ 1 Y^ 1 s=i r k(G,s,c). Then by exchangeability and dominated convergence, 
E/3 k{K n ) = E r k {G, 1,c) = Er fc (G M ,l,c) Er fe (T,0,c). 

We need to show that (3k{K n ) —>• (3 k in probability as n —>• oo, and a.s. if n is replaced with 
nj —> oo such that < 00 • Put = r k (G,s,c). By exchangeability, 

Var [/3 k (K n )] = n -1 Var(£i) + 2(1 - n -1 ) Cov(£i,£ 2 ) < u _1 + 2| Cov(£l,£ 2 )|- (H) 

Let S\ = ljdistance between 1 and 2 in G is > 2k}. Then 

2 fe —1 

P {Si = 0 } < ^2 P{3*ii ■ ■ • )* 2 fe-i s.t. {i t ,i t+ 1 } G E(G), 0 < t < 2k, with i 0 = 1 , i 2k = 2} 

1=0 
2k—1 

< n2k ~ 1 (p/ n ) 2k = O k (l/n), 

1=0 
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where O k (-) denotes that the implicit constant depends only on k in addition to the fixed p and c. 
Note that when S± = 1, G ki \ and G k ^ are disjoint. Let £2 = l{|Gfc ]S | < n/2,s = 1,2}. Denote by 
dis (u,v) = distance between u and v in G. By |Go, s | = \{u : dis(s,u) = 0}| = 1, 


E \{u : dis(s, u ) 


A:} | < E 


n 

£ £i{{u,»}e£(G)} 

v:dis(s,v)=h —1 u= 1 


< p E \{u : dis(s, u) 


k - 1}|, 


and induction, E|Gfc )S | < 1 + p + • ■ ■ + p k = O k (l). Then by Markov inequality, P {£2 = 0} = 
O k (l/n). Let £ = £]£ 2 . Then P{£ = 0} = O k (l/n). Conditioning on £ = 1, [(Gfc t i,l)] and 
[(Gfc,2,2)] are i.i.d. ~ [(G k , 1 ,1)] conditioning on |G fc) i| < n/2. Since for s = 1,2, = r k (G k , s ,s,c ) 
only depends on [(Gfe jS ,s)], Cov(£i,£ 2 [£ = 1) = 0. By exchangeability E(£i | £) = E (^2 I ‘S ')7 
denoted by hs- Then 

Cov(£i,6) = E[Cov(ei,61 £)] + Cov(e(6 | £),E(61 £)) 

= Cov(6,6 I £ = 0) p{£ = 0} + Var (h s ) 

= O k (l/n) + (hi - h 0 ) 2 P{£ = 0} P{£ = 1} = O fc (l/n), 

so by (11), V&r[f3 k (K n )] = Ofc(n _1 ). This implies that /3 k (K n ) — E [/3 k (K n )\ —>■ 0 in probability, so 
/ 3 k (K n ) —> /3 k in probability. Moreover, for nj — > 00 with X) 71 / 1 < °°5 by Borel-Cantelli lemma, 
Pk(K nj ) ~ E [l3 k (K nj )] 0 a.s., giving /3 k (K nj ) (3 k a.s. 

Since the entries of K n are nonnegative with row sums no greater than 1, g{D~ l A n ) < 1 and 
hence pK n is supported in [—1,1]. Meanwhile, by Weierstrass theorem, polynomials are dense 
in C{[— 1,1]). Then by the convergence in probability of /3 k (K n ) and standard results on weak 
convergence ([9], Section 8.4), hk„ weakly converges in probability to a probability distribution 
with support in [—1,1] and moments /3 k . Finally, for any rij 00 with Yl n J l < 00, the a.s. weak 
convergence of the ESD of K Ui follows from the a.s. convergence of f3 k (K nj ). □ 

4 Essential supremum of the limit of ESD 

Let poo be the limiting distribution of PK n , where again K n = D~ l A n . The main result of this 
section is the following. 

Theorem 6. For any p > 0, ess sup = t c . 

Thus, for fixed c > 0, as p —>• 00, ess sup does not vanish. This may be compared to the case 
where the underlying random graph follows G n) d- By [22], the corresponding essential supremum 
is 2 y/d — 1/(c + d ) so, given c > 0, it tends to 0 as d —> 00. 

Proof of Theorem 6 . Given s > 1 , the probability q s that T is a tree on { 0 ,vi,..., v s } with E(T) = 
{{05^}) 7 = 1 , ■ ■ ■, <s} is positive. For this T and k = 2m, it is easy to get r k (T,®,c) = [s/(c + 
s)] m [l/(c+ l)] m . Then by Theorem 5, f 3 k > g s [s/(c + s)] m [l/(c + l)] m , yielding esssup^oo > 
V s/(c + s ) t c . Letting s —>• 00 then gives ess sup //qo > t c . 

To show esssup/ioo < r c , it suffices to consider c > 0. For k = 2m, arguing as in the proof of 
Theorem 5, (3 k = E r k (T,$,c) = E r k (T m+ i, 0, c) = EWe claim that for any finite graph 
G, v € G, and k > 1, 

I (K k G ) vv \<Q(K G ) k . (12) 
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Figure 1: Boxplots of randomly sampled A*(M n ). 

Together with Lemma 4, this implies | {Kr m , )@01 < T k . As a result /3 k < T k and hence ess sup /ioo < 
r c , which completes the proof. 

To prove (12), suppose V(G) = {l,...,n}. Then Kq = D~ 1 Aq, where a = (oi,..., a n )' = 
cl n + Acl n - Let 6 = (y/ai,..., . Then Kq = D^ 1 B k Db, where B = D^ 1 AqD^ 1 . Since 

Df, is diagonal, ( Kq)h = (B k )u, i < n. Since B is symmetric, so is B k . In general, for any 
symmetric real-valued matrix H , since g(H) ±H is nonnegative definite, maxj \Ha\ < q{H). Thus 
\{Kq)h\ < g(B k ) = g(B) k = g(Kc) k , as claimed. □ 

We now can prove Theorem 1. Without loss of generality, let b € (0, r c ) be a continuity point of 
the distribution function of //qq. Then by Theorems 5-6, gK n (J) —>• Hoo(b, oo) > 0 in probability 
for J = (—oo,—6), (6,oo), and so for any l > 1, P{A;(A' n ) < —6 and A n _; + i(iL n ) > 6} —» 1. 
Together with Proposition 2, this yields 

P{6 < \ n -i + i{K n ) < r c / k and - r c / k < Xi(K n ) < -6} 1. 

Since the convergence holds for alH > 1, then by Weyl’s inequality (2), the proof is complete. 

We conducted a simulation study to examine the tightness of the bounds in Theorem 1. Given 
c, for each p € {0.5,1,1.5, 2, 2.5, 3}, we used MATLAB function eig to calculate A *(M n ) for 200 
randomly sampled M n with n = 4000. In each panel of Figure 1, the boxplot of the sample values 
of A*(M n ) is shown as a function p. On each box, the central mark is the sample median, the edges 
of the box are the 1st and 3rd sample quartiles, the whiskers extend to the most extreme sample 
values considered by MATLAB to be non-outliers, and the outliers are plotted individually as “x”. 
The y-coordinate of the long horizon line extending from p = 0.5 to 3 equals r c . As Figure 1 shows, 
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for p = 0.5 and most of p > 1, even when n = 4000, A*(M n ) is still quite below r c . Since A*(M n ) is 
asymptotically lower bounded by r c according to Theorem 6, this suggests that its convergence is 
slow. The plots also indicate that for different values of c, there are different values of p for which 
the convergence is fastest in terms of how fast A*(M„) approaches or goes above r c and how fast 
its variation decreases. Among all the pairs of c and p, only (c, p) = (0.5,1.5) and (1,2) generated 
a significant number of A*(M n ) that were no less than r c , with the fraction of such A*(M n ) equal 
to 22% and 20%, respectively. When n was increased to 6000, the fraction changed to 18% and 
22%, respectively. However, the differences in fraction are not statistically significant. To see if 
the relatively high fractions were due to fluctuations of the bulk of the eigenvalues, we counted 
the total number of eigenvalues with absolute values no less than r c . When n = 4000, for each 
pair, there were 200 x 4000 = 8 x 10 5 absolute eigenvalues in total. Only 246 and 243 of them, 
respectively, were no less than r c . When n = 6000, the counts changed to 239 and 246, respectively. 
Thus, the fluctuation of the bulk had little to do with the relatively high percentages of A*(M n ) 
greater than r c . 

We were unable to go beyond n = 6000 due to limited computing capacity. Nevertheless, the 
numerical results suggest that for p > 1, r c is not a tight lower bound, at least in the probabilistic 
sense that there is t' > r c , such that P{A*(M n ) > t'} 0. The numerical results also suggest, 

more convincingly, that the upper bound in Theorem 1 is far from being tight, especially for large 
p. It would be interesting to see whether A*(Af n ) has a nonrandom limit or weakly converges to a 
nondegenerate distribution and in either case, at what rate of convergence. 

5 A formula for moments of the limit of ESD 

This section gives a more explicit formula for the moments /5k of the limiting distribution 11 ^. 
To state the result, if v is a path on I n := {1,... ,ro}, denote by [u] the graph whose vertex set 
consists of the distinct elements among Uj, and whose edge set consists of the distinct unordered 
pairs among {vi, Uj + i}, 1 < i < k. Denote by Ck, n the set of closed paths of length k on I n . Denote 
by ro(-, v) the number of times an object appears in v. Thus, for u, v! £ I and e = {u, un(u, v ) = 
1 {vi = u} and n(e,v) = Ya=i H{ x i, x i+i} = e}. Also, denote n+(u,v) = Y. X £i n (( u > x )i v )’ 
i.e., the number of directed edges in v starting at u. Denote by E n the set of edges of the complete 
graph on I n . Following the definition on p. 17 of [1], a path i € Ck )U is called canonical if i\ = 1 
and ij < max(7],..., ij~i) + 1 for 2 < j < k. For such a path i, if | [i]| = t, then the set of distinct 
values of ij is {1,..., t}. Let 

Tfc.t = {* S Cfc,n : * is canonical, [i] is a tree of size t}. 

As long as n > t, the definition is independent of n. Note that for i £ Ck )n , [*] is a tree |[i]| = 
e([i]) + 1, and when this is the case, each e € E([i]) is traversed by i on both directions the same 
number of times, and hence k is even. Therefore, F^. t = 0 if k is odd. 

Proposition 7. Let c > 0. Then for even k = 2m, 


m+1 


t 



Pk=Yl pt ~ 

-E 

IP 

\c + d(a,[i ])+0- n+{a ’ i} ' 

, £ ~ Po(p). 

t=2 


a= 1 



Proof. Given n, write A n = 

faj) e 

{0, l} nxn . For e = (i, j} £ E ni denote e e = e t] 


w 

n 

= C ^ ^ Sij , Xij = Eij / Wi . 



j =1 
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For (yij) € M nxn and (zj) € M n , and for i € C k , n , denote y(i) = yi x i 2 yi 2 i z ■ ■ ■ yi k _^i h yi k ii and 
z(i) = Zi x - ■ ■ Zi k . Then from the proof of Theorem 5, 

E[tr(A'*)]= XI E [*(<)]. (13) 

i£Ck,n 

Given i € Ck,m let t = |[i]| and s = e([i]). Since s(i) = fl eeE([i\) with a^ n{e,i) > 1, 

x(i) = s(i)/w(i) / 0 s e = 1 for all e € E([i]) -<==>■ e(i) = 1. As e e , e € E([i]), are 
i.i.d. ~ Bern (p/n), P{e(i) = 1} = ( p/n) s . For j < k, w t] > c + £i jij+1 and {ij,ij+ 1 } € E([i]). 
Consequently, e(i) = 1 implies Wi j > c + 1 for all j. As a result, 

E[x(i)] = E [l{e(i) = 1} /w(i)\ < (c + l)~ k p{e(i) = 1} = (c + 1 )~ k (p/n) s . 


Since [i] is connected, 1 < e([i]) < k and 2 < |[i]| < e([i]) + 1. For 2 < t < k, the number of 
i € C kn with |[i]| = t is less than (")t fc . As a result, for n > 2, 

k s 

X em«)] = xX X E[x(>)] 

*6 Ck,n- |[i]|<e([*]) 8=1 t=2 ieCfc, n : |[i]|=t,e([i])=s 

k s 

< JVi fc (c+l)-*(p/n) a 

8=1 t= 1 

k s k 

< (c + l)- fc J] s V Y S < 2(c + l)- fc sV- 

S=1 t=l S=1 

Since p and c are fixed, then by (13) 

EM**] = X E [x(i)] + Ofe(l), as n —>• oo. (14) 

ieCfc, n : |[i]|=e([i])+l 


Let i be a path counted on the right hand side of (14) and |[i]| = t. Then 2 < t < m + 1 and 

t / \ n((a,6),i) 


x(i) = 


£ * 


re U 


n 

a,6=1 


Clearly x(i) is a deterministic function of A n , denoted by F(A n ). Arrange the elements of V([i]) 
as Z \,..., Zt in the order of initial appearance in i and let <j(zi) = l. Then a : V([*D —>• {1,..., t} is 
the unique bijection such that a(i) := (<r(*i),..., (j(i k ), <t(*i)) € Extend a to a permutation 
of {1,... ,n}, still denoted a. Let S = ( Sij ) € A4 n with s t j = l{i = cr(j')}. From S'A n S = (iij) 
with Sij = Eh SkiSijSij = £a(i)<r(j), = e(c r (i))/u;(cj(*)) = F(S'A n S). Since A n ~ <S"A n S, 

then x(«) ~ x(a(i)), in particular, E[x(*)] = E[x(cr(i))]. 

It is easy to see that for each i € T^t, there are exactly n\/(n — t)\ paths counted on the right 
hand side of (14) that can be mapped in the above way to i. As a result, 


m +1 | 

E[tr(-flffi] = ^ Y E [ z (*)] + Ofc(l), n ^oo. (15) 

t =2 ^ ’’ ier ktt 

Given t = 2,.. ., m + 1 and i € r^, for a = 1,..., t, write 

Qa ^ ^ 1/a ^ ' &ei ^a ^ ^ ^e- 

aee€-B([i]) aGeeB t \£([i]) aSeSE n \E t 
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Then w a = c + q a + y a + S a . Put y = (yi,..., y t ). Then y, Si,..., S t , and e e , e € £([i]), are all 
independent, and x(i) / 0 4=k s e = 1 for all e € E([i]). Since e([i]) = t — 1, then 


E[x(i)] = E[x(i)l{e e = 1 Ve <E £([*])}] 


= (p/n) E 


£ e = lVeG £([*]) 


II (c + ga + y« + 5 a r n((a - ft) ’ <) 

a,6=1 

On the other hand, when e e = 1 for all e € E([*]), q a = d(a, [*]) for all a = 1,..., t. Then 

' t 

E[s(i)] = ( p/nf - 1 E I] ( c + d (°. W) + Va + S a )~ ELl n((a ’ 6)li) 

_a= 1 

" t 

= (p/n)*- 1 E n( c + d ( a ’ [*]) + 2/a + 


_a= 1 


Let n —>• oo. Since t is fixed, (y,Si, ■ ■ ■, St) —>• (0, ^i,..., in distribution, with ^ i.i.d. ~ Po(p). 
Then by (15) and dominated convergence, for k = 2m, 


m+l 


E[/3 fc (iL n )] = n -1 E[tr(iL^)] S E 

t=2 *er fcit 


nic+^a^tD+ear^^ 


,a=l 


finishing the proof. □ 
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